Goto

Collaborating Authors

 Thiès


Kallaama: A Transcribed Speech Dataset about Agriculture in the Three Most Widely Spoken Languages in Senegal

arXiv.org Artificial Intelligence

This work is part of the Kallaama project, whose objective is to produce and disseminate national languages corpora for speech technologies developments, in the field of agriculture. Except for Wolof, which benefits from some language data for natural language processing, national languages of Senegal are largely ignored by language technology providers. However, such technologies are keys to the protection, promotion and teaching of these languages. Kallaama focuses on the 3 main spoken languages by Senegalese people: Wolof, Pulaar and Sereer. These languages are widely spoken by the population, with around 10 million of native Senegalese speakers, not to mention those outside the country. However, they remain under-resourced in terms of machine-readable data that can be used for automatic processing and language technologies, all the more so in the agricultural sector. We release a transcribed speech dataset containing 125 hours of recordings, about agriculture, in each of the above-mentioned languages. These resources are specifically designed for Automatic Speech Recognition purpose, including traditional approaches. To build such technologies, we provide textual corpora in Wolof and Pulaar, and a pronunciation lexicon containing 49,132 entries from the Wolof dataset.


Humans Beat Deep Networks at Recognizing Objects in Unusual Poses, Given Enough Time

arXiv.org Artificial Intelligence

Deep learning is closing the gap with humans on several object recognition benchmarks. Here we investigate this gap in the context of challenging images where objects are seen from unusual viewpoints. We find that humans excel at recognizing objects in unusual poses, in contrast with state-of-the-art pretrained networks (EfficientNet, SWAG, ViT, SWIN, BEiT, ConvNext) which are systematically brittle in this condition. Remarkably, as we limit image exposure time, human performance degrades to the level of deep networks, suggesting that additional mental processes (requiring additional time) take place when humans identify objects in unusual poses. Finally, our analysis of error patterns of humans vs. networks reveals that even time-limited humans are dissimilar to feed-forward deep networks. We conclude that more work is needed to bring computer vision systems to the level of robustness of the human visual system. Understanding the nature of the mental processes taking place during extra viewing time may be key to attain such robustness.


Analysis of COVID-19 evolution in Senegal: impact of health care capacity

arXiv.org Machine Learning

We consider a compartmental model from which we incorporate a time-dependent health care capacity having a logistic growth. This allows us to take into account the Senegalese authorities response in anticipating the growing number of infected cases. We highlight the importance of anticipation and timing to avoid overwhelming that could impact considerably the treatment of patients and the well-being of health care workers. A condition, depending on the health care capacity and the flux of new hospitalized individuals, to avoid possible overwhelming is provided. We also use machine learning approach to project forward the cumulative number of cases from March 02, 2020, until 1st December, 2020.